Hebrew Vowel Restoration With Neural Networks

نویسندگان

  • M. Spiegel
  • J. Volk
چکیده

Modern Hebrew is written without vowels, presenting a problem for those wishing to carry out lexical analysis on Hebrew texts. Although fluent speakers can easily replace vowels when reading or speaking from a text, there are no simple rules that would allow for this task to be easily automated. Previous work in this field has involved using statistical methods to try to solve this problem. Instead we use neural networks, in which letter and morphology information are fed into a network as input and the output is the proposed vowel placement. Using a publicly available Hebrew corpus containing vowel and morphological tagging, we were able to restore 85% of the correct vowels to our test set. We achieved an 87% success rate for restoring the correct phonetic value for each letter. While our results do not compare favorably to previous results, we believe that, with further experimentation, our connectionist approach could be made viable.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An HMM Approach to Vowel Restoration in Arabic and Hebrew

Semitic languages pose a problem to Natural Language Processing since most of the vowels are omitted from written prose, resulting in considerable ambiguity at the word level. However, while reading text, native speakers can generally vocalize each word based on their familiarity with the lexicon and the context of the word. Methods for vowel restoration in previous work involving morphological...

متن کامل

Many ways to read your vowels - Neural processing of diacritics and vowel letters in Hebrew

The current study examined the effect of orthographic transparency and familiarity on brain mechanisms involved in word recognition in adult Hebrew readers. We compared the effects of diacritics that provide transparent but less familiar information and vowel letters that increase orthographic transparency without compromising familiarity. Brain activation was measured in 18 adults during oral ...

متن کامل

Language Support A Simple Technique for Typesetting Hebrew with Vowel Points

This paper describes a simple mechanism for typesetting Hebrew with vowel points. Hebrew uses a large set of accents that represent vowels, consonant modifiers, and cantillation instructions. These accents are placed above, below, or inside letters; a single letter can carry several accents. The solution that we describe, which is designed for PostScript [2] output devices, leaves the placement...

متن کامل

Vowel reduction in Modern Hebrew: Traces of the past and current variation

The aim of this paper was to find out the scope and boundaries of a-reduction in Modern Hebrew. In Classical Hebrew, vowel reduction was a regular, obligatory process. In Modern Hebrew, it has restricted scope and operates under opaque conditions. The only reliable trace of the historical motivation for the rule is the Hebrew vocalization system (nikud). 100 participants in four age groups were...

متن کامل

Remarks on the Development of Some Pronominal Suffixes in Hebrew

The paradigmatic pressure for the preservation of the final vowels of pronominal suffixes after long vowels, where gender opposition could not be marked by the preceding vowel, was strong enough to create in rabbinic Hebrew, in Aramaic, and in Arabic, dialect doublets, viz., suffixes without final vowel after originally short vowels (as rabbinic Hebrew yadak 'your hand'), and those with final v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005